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• Premise of the study: Microsatellite primers were developed for Geranium carolinanum, sl North American winter annual herb, 
for use in population genetic analyses. 

• Methods and Results: Genomic DNA enriched for repeat-containing fragments was sequenced on the Roche 454 Titanium 
platform, resulting in 470 primer pairs developed from 1115 microsatellite-containing sequences. A subset of 37 primer pairs 
was screened for polymorphism across three native and three invasive populations. We identified four monomorphic and eight 
polymorphic loci. Polymorphic loci contained between two and seven alleles per locus, and mean within-population expected 
heterozygosity ranged from 0.100 to 0.290. Within populations, observed heterozygosity for individual loci ranged from zero 
to 0.857, and expected heterozygosity ranged from 0.046 to 0.559. 

• Conclusions: These microsatellite markers will be useful for future studies of genetic diversity, structure, and mating systems 
across the geographic range of G. carolinianum, and may be transferable to other closely related species. 
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Geranium carolinianum L. (Geraniaceae) is a weedy winter 
annual herb native to North America and naturalized in East 
Asia, South America, and the Caribbean (Aedo, 2000). In 
China, it is naturalized along roadsides and in agricultural 
fields, and is considered an invasive species with minor envi- 
ronmental impacts (Liu et al., 2006). The origin and colonization 
history of these introduced populations is unknown. Although 
it is not closely related to the horticultural geranium (genus 
Pelargonium L'Her. ex Aiton), genetic resources developed 
for G. carolinianum are potentially transferable to other Gera- 
nium L. species, which includes annuals and perennials that 
are both native and introduced in North America (Aedo, 2000). 
Here, we present 12 microsatellite loci developed using targeted 
enrichment and next-generation sequencing. 

METHODS AND RESULTS 

DNA was extracted from leaf tissue using a QIAGEN Plant Mini Kit 
(QIAGEN, Valencia, California, USA) and enriched for microsatellites using 
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three probe mixes [Mix 2 = (AG) 12 , (TG) 12 , (AAC) 6 , (AAG) 8 , (AAT) 12 , (ACT) 12 , 
(ATC) 8 ; Mix 3 = (AAAC) 6 , (AAAG) 6 , (AATC) 6 , (AATG) 6 , (ACAG) 6 , (ACCT) 6 , 
(ACTC) 6 , (ACTG) 6 ; Mix 4 = (AAAT) 8 , (AACT) 8 , (AAGT) 8 , (ACAT) 8 , 
(AGAT) 8 ] as in Glenn and Schable (2005). DNA from one individual was di- 
gested separately with Rsa\ (New England Biolabs, Ipswich, Massachusetts, 
USA) and Alul (New England Biolabs). Digests were pooled, a single 
adenosine was added with exo-Klenow fragment (New England Biolabs), 
and fragments were ligated to CAG-SimpleXT-13 adapters (upper oligo = 
GTTTC AGTCGGGCGTCATCA CCAGCACCGGAACAT; lower oligo = /5Phos/ 
TGTTCCGGTGCTGG). Linker-ligated DNA was pooled, denatured, and hy- 
bridized to biotinylated microsatellite oligonucleotide mixes, which were sub- 
sequently captured on magnetic streptavidin beads (Dynabeads M-280; 
Invitrogen, Carlsbad, California, USA), and remaining DNA was washed away. 
The enriched DNA was eluted and amplified in PCR using Phos_Pig_CAG 
(/5Phos/GTTTC AGTCGGGCGTCATCA). PCR reactions were run in a 25 uL 
total volume with 2 uL eluted DNA, 0.2 uL Phusion Taq polymerase (5 units/uL; 
New England Biolabs), lx Phusion reaction buffer (New England Biolabs), 
0.025 mg/mL bovine serum albumin (BSA; New England Biolabs), 2.0 mM 
MgCl 2 , 0.15 mM each dNTP (Sigma-Aldrich, St. Louis, Missouri, USA), and 
0.5 uM Phos_Pig_CAG. Cycling conditions were 2 min at 95°C; 25 cycles of 
20 s at 95°C, 20 s at 60°C, and 1 .5 min at 72°C; and a final 30-min extension at 
72°C. Multiple libraries with additional different Simplex sequences were 
then pooled and ligated into a 454 RL MID-tagged library, pooled with addi- 
tional 454 RL MID-tagged libraries, and sequenced on half a run of the Roche 
454 FLX with Titanium chemistry using the manufacturer's protocols. 

Following demultiplexing, 1115 sequences with microsatellites were identi- 
fied from 5586 sequence reads using MSATCOMMANDER version 0.8.1 
(Faircloth, 2008). All microsatellite-containing sequences are provided in 
Appendix S 1 . MSATCOMMANDER also designs primers for microsatellite- 
containing sequences using Primer3 (Rozen and Skaletsky, 2000). Sequences 
were discarded if Primer3 could not design acceptable primers. All primers 
(excluding those tested as described below, a total of 433 pairs) are reported in 
Appendix S2. To avoid selecting primers within highly repetitive regions of the 
genome, sequences were assembled into contigs by automatic assembly using 
default parameters in Sequencher version 4.7 (Gene Codes Corporation, Ann 
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Arbor, Michigan, USA). Microsatellite-containing sequences that were at high 
frequency (i.e., members of contigs with >10 sequences) were not considered 
for screening. 

Of the remaining sequences, we selected 37 primer pairs for testing. The 
majority of primers tested were based on unique sequences in the data set 
and amplified tri- or tetranucleotide repeats. Forward primers were tagged 
on the 5' end with the universal "CAG" sequence (CAGTCGGGCGT- 
CATCA) to allow fluorescent labeling of PCR product using a 3 -primer pro- 
tocol as detailed below. Additionally, a 5'GTTT "pigtail" was added to some 
reverse primers to ensure consistency in amplicon size (Brownstein et al., 
1996). These 37 primer pairs were screened using DNA template from 12 
samples across G. carolinianum 's native range. All primer pairs were ini- 
tially tested using a "touchdown 58" PCR protocol. Cycling conditions 
were 3 min at 95°C; nine touchdown cycles of 30 s at 95°C, 30 s at 65-58°C 
(starting at 65°C and decreasing one degree each cycle), and 45 s at 72°C; 
30 cycles of 30 s at 95°C, 30 s at 50°C, and 45 s at 72°C; and a final exten- 
sion of 20 min at 72°C. Primer pairs that did not amplify consistently at 
touchdown 58 were tested with a "touchdown 50" protocol, which was iden- 
tical to touchdown 58 except the annealing temperatures for the touchdown 
cycles decreased from 58°C to 50°C, followed by 30 cycles at an annealing 
temperature of 50°C. All PCR reactions were carried out in a 12.5 jiL total 
volume with 10-200 ng DNA template, 0.1875 units JumpStart Taq DNA 
polymerase (Sigma- Aldrich), lx JumpStart reaction buffer without magne- 
sium (Sigma- Aldrich), lx (0.1 mg/mL) BSA, 2 mM MgCl 2 , 0.15 mM each 
dNTP, 0.025 jiM forward (CAG-tagged) primer, 0.25 jiM reverse (untagged) 
primer, and 0.25 JiM universal CAG primer labeled with a 5' FAM, HEX, or 
NED fluorescent dye. Primer pairs with consistent amplification at either 
touchdown 58 or touchdown 50 were screened for polymorphism by frag- 
ment analysis. PCR products from the initial 12 DNA samples were sized on 
an ABI 3730x1 capillary sequencer (Applied Biosystems, Carlsbad, California, 
USA) using a ROX-labeled internal size standard (GGF500R; Georgia Genom- 
ics Facility, Athens, Georgia, USA). Chromatograms were analyzed using 
GeneMapper version 3.7 (Applied Biosystems). 

Of the 37 primer pairs tested, we identified eight polymorphic and four 
monomorphic loci with high-quality amplification (Table 1). Primers that 
performed poorly in the screening are listed in Appendix S3. Monomorphic 
loci were tested further against either 48 or 96 individual DNA samples 
from across the native and invasive ranges, but no polymorphism was found. 



Polymorphic loci were used to assess genetic variation in 103 individual 
DNA samples from three native populations in the United States and three 
invasive populations in eastern China (Table 2). For GC7 and GC35, only a 
subset of DNA samples were available for one of the populations from 
Georgia, USA, and the population from Henan, China, so sample sizes from 
other populations were increased to reach a total sample size of roughly 
n = 100 per locus. Voucher specimens from the two Georgia and three 
China populations have been deposited at the University of Georgia Her- 
barium (GA). We were unable to obtain a voucher for the North Carolina 
population. 

We calculated the number of alleles, observed heterozygosity (H 0 ), and 
expected heterozygosity (H e ) for each polymorphic locus and population 
in GenAlEx version 6.2 (Peakall and Smouse, 2006). Results are given in 
Table 2. Across all samples, number of alleles per polymorphic locus ranged 
from two to seven, and five loci were monomorphic in at least one popula- 
tion. Mean population H e (averaged across loci) ranged from 0. 100 to 0.290. 
Within populations, H 0 for individual loci ranged from zero to 0.857, and H e 
ranged from 0.046 to 0.559. We did not test for Hardy-Weinberg equilib- 
rium because this is a colonizing, mixed-mating species. 



CONCLUSIONS 

We report 12 microsatellite loci in G. carolinianum, eight 
of which are shown to be variable in native and invasive 
populations. Forward primer sequences are labeled with a 5' 
universal sequence to allow fluorescent tagging of amplicons 
during PCR, and reverse primer sequences (with the excep- 
tion of GC1-R, GC6-R, GC7-R, and GC10-R) are labeled 
with a 5' GTTT pigtail sequence for consistent genotyping. 
Although not tested, addition of a 5' pigtail to these primers 
should facilitate accurate genotyping. These microsatellite 
loci will be useful for studies of genetic variation, gene flow, 
and mating system in G. carolinianum and may be transfer- 
able to related species. 



Table 1 . Characteristics of microsatellite primers developed in Geranium carolinianum. 











Fragment 








GenBank 


Locus 




Primer sequences (5'-3') 


Repeat motif 


size (bp) 


r a (°o 


Dye a 


accession no. 


GCl b 


F 

R 


CAGTCGGGCGTCATCATTGTGAGCTTGCTCTTGCC 
AAGGCATCCCAACAGAGGG 


(CTT) 4 


203 


65- 


-58 


FAM 


KC433595 


GC6 b 


F 

R 


CAGTCGGGCGTCATCACGAGTTGCAGCTACCAAGC 
TGGAGCCTCTATTGCACCC 


(CTT) 4 


240 


65- 


-58 


HEX 


KC433596 


GC7 


F 

R 


CAGTCGGGCGTCATCATCTCGCTCATCATCACTCTCC 
GAACGAAGCAATCCGCTGG 


(CTT) 18 


217 


58- 


-50 


NED 


KC429676 


GC10 


F 

R 


CAGTCGGGCGTCATCAGACCTGGAGGTAAGTCCCTG 
GGAGCTCGGCTACTCTTCC 


(CTT) 16 


195 


58- 


-50 


HEX 


JX075892 


GC29 


F 

R 


CAGTCGGGCGTCATCAACTGCGCTTGTAGAAATCTG 
GTTTGTGATTTTGACGGTGGGC 


(AGG) 7 


154 


58- 


-50 


HEX 


JX075893 


GC31 


F 

R 


CAGTCGGGCGTCATCAGTGGTTGGGGTGTGTGAAC 
GTTTC G AAAG AAC C G AAC C GG AC 


(GAGT) 4 


244 


65- 


-58 


FAM 


JX075894 


GC33 b 


F 
R 


CAGTCGGGCGTCATCACCGGGAATGGCTAGTACG 
GTTTGGATGCCTAAGCTGTCCAAG 


(GTTT) 5 


188 


58- 


-50 


FAM 


KC433597 


GC35 


F 
R 


CAGTCGGGCGTCATCACTCTCTTCTCTCGGCCACC 
GTTTCGAACGAGGGGCATTTTCG 


(ATT) 9 


223 


58- 


-50 


HEX 


KC429677 


GC36 b 


F 
R 


CAGTCGGGCGTCATCACTTGCTCTGGTCAGTCTTGG 
GTTTGGGAGGATAGGGAATCTGCTG 


(CTT) 14 


268 


58- 


-50 


HEX 


KC433598 


GC38 


F 
R 


CAGTCGGGCGTCATCAGCTAGGATCAGCAGTCCCG 
GTTTGCTCAATGTCTCGCAGG 


(GCCT) 5 


150 


58- 


-50 


FAM 


JX075895 


GC39 


F 
R 


CAGTCGGGCGTCATCAGCTCGTGAGTTCATTATGTTTGC 
GTTTCAATCCAGCCACCTTTCGC 


(GTTT) 6 


242 


58- 


-50 


NED 


JX075896 


GC47 


F 
R 


CAGTCGGGCGTCATCATGGAGTCTCTCGCAACAC 
GTTTCAACTCAGGCTCTGCTCC 


(ACAT) 6 


204 


58- 


-50 


FAM 


JX075897 



Note: T a = range of annealing temperatures used in touchdown PCR. 
a Fluorescent dye used for fragment analysis. 
b Monomorphic locus. 
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